Canonical Autocorrelation Analysis

نویسندگان

  • Maria De-Arteaga
  • Artur Dubrawski
  • Peter Huggins
چکیده

We present an extension of sparse Canonical Correlation Analysis (CCA) designed for finding multiple-tomultiple linear correlations within a single set of variables. Unlike CCA, which finds correlations between two sets of data where the rows are matched exactly but the columns represent separate sets of variables, the method proposed here, Canonical Autocorrelation Analysis (CAA), finds multivariate correlations within just one set of variables. This can be useful when we look for hidden parsimonious structures in data, each involving only a small subset of all features. In addition, the discovered correlations are highly interpretable as they are formed by pairs of sparse linear combinations of the original features. We show how CAA can be of use as a tool for anomaly detection when the expected structure of correlations is not followed by anomalous data. We illustrate the utility of CAA in two application domains where single-class and unsupervised learning of correlation structures are particularly relevant: breast cancer diagnosis and radiation threat detection. When applied to the Wisconsin Breast Cancer data, singleclass CAA is competitive with supervised methods used in literature. On the radiation threat detection task, unsupervised CAA performs significantly better than an unsupervised alternative prevalent in the domain, while providing valuable additional insights for threat analy-

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Canonical Correlation Approach to Blind Source Separation

A method based on canonical correlation analysis (CCA) for solving the blind source (BSS) problem is presented. In contrast to independent components analysis (ICA), the proposed method utilises the autocorrelation in the source signals. This makes the BSS problem easier to solve than if only the statistical distribution of the sample values is considered. Experiments show that the method is mu...

متن کامل

Exploratory fMRI analysis by autocorrelation maximization.

A novel and computationally efficient method for exploratory analysis of functional MRI data is presented. The basic idea is to reveal underlying components in the fMRI data that have maximum autocorrelation. The tool for accomplishing this task is Canonical Correlation Analysis. The relation to Principal Component Analysis and Independent Component Analysis is discussed and the performance of ...

متن کامل

Spatio-temporal Methods in Climatology

Glossary AR(1): Autoregressive model of order one. The present state of a system can be described as a linear function of the state at the previous time, plus timeindependent noise. data assimilation: Method of optimally combining irregularly spaced observations with dynamical constraints to produce dynamically consistent fields on regular grids. CCA: Canonical Correlation Analysis EOF: Empiric...

متن کامل

Classification of Normal Sequences

Base sequences BS m,n are quadruples A;B;C;D of {±1}-sequences, with A and B of length m and C and D of length n, such that the sum of their nonperiodic autocorrelation functions is a δfunction. Normal sequences NS n are base sequences A;B;C;D ∈ BS n, n such that A B. We introduce a definition of equivalence for normal sequences NS n and construct a canonical form. By using this canonical form,...

متن کامل

23 Multivariate Analysis Techniques in Environmental Science Mohammad

One of the characteristics of environmental data, many of them and the complex relationships between them. To reduce the number variables, different statistical methods exist. Multivariate statistics is used extensively in environmental science. It helps ecologists discover structure and previous relatively objective summary of the primary features of the data for easier comprehension. However,...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1511.06419  شماره 

صفحات  -

تاریخ انتشار 2015